-
Notifications
You must be signed in to change notification settings - Fork 18.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Splitting source files between CUDA and CPU code. #172
Conversation
@erictzeng congrats, it compiles, build and pass the tests in OSX 10.7 |
@erictzeng Something weird happened, the second time multinomial_logistic_loss_layer test failed. This could be a problem of the test itself @Yangqing?
|
@sguada it's just a precision error. This test was relaxed in the Note that this does not include the MKL/non-MKL merge yet, which is what is broken on OSX. However, we expect the combination of this and #165 to fix #122. |
@shelhamer I think 0.02 is quite big for a precision error. |
I'm quoting a previous conversation with @Yangqing, who said it was safe to ignore. Thank you for bringing up the question–we should aim for better test documentation. |
Alex also said it was safe to ignore. |
The multinomial loss precision should be fine :) Splitting this way should be fine too. Yangqing On Wed, Feb 26, 2014 at 7:29 PM, Lin Min [email protected] wrote:
|
@erictzeng let's figure out #165 now. I'm doing a rebase, then we can check what's what. |
@shelhamer @mavenlin thanks for the pointers. I think test should consistently pass or fail, that's part of the automatic testing. So please adjust the epsilon if it needs to be adjusted. |
Thanks Evan, I would think about how to test the mat wrapper as mentioned in #173 quickly |
I'm wondering: has there been work done to compile it without the cuda parts? From quickly skimming the changes(and correct me if I'm wrong), it looks like the classes are split up, but both CPU and the GPU code need to be compiled. In any case, this is of course a very useful first step for building a CPU only version. |
You're right in that as of right now, CPU and GPU still have to be compiled together. Clean separation of the two so that Caffe can be compiled without GPU is one of our short-term goals, though, so consider this as just step 1 in an ongoing process! |
Added new GaussianStatic dummy layer
fix the bug in input reshape of cuDNN
Addresses #152. Tests pass.
@Yangqing, I modeled this after some of the layer files that were already split between .cpp and .cu, but I'd appreciate a sanity check just to ensure that this way of splitting things isn't going to cause much pain and misery in the future. :)